Skip to content

Adding support for universal NIMs#551

Merged
visheshtanksale merged 1 commit intoNVIDIA:mainfrom
visheshtanksale:universal-support
Jun 28, 2025
Merged

Adding support for universal NIMs#551
visheshtanksale merged 1 commit intoNVIDIA:mainfrom
visheshtanksale:universal-support

Conversation

@visheshtanksale
Copy link
Copy Markdown
Collaborator

@visheshtanksale visheshtanksale commented Jun 27, 2025

Sample NIMCache yaml for NGC endpoint

apiVersion: apps.nvidia.com/v1alpha1
kind: NIMCache
metadata:
  name: ngc-nim-cache-universal
spec:
  source:
    ngc:
      modelPuller: gitlab-master.nvidia.com:5005/dl/ai-services/microservices/nim-llm/nim-agnostic:e808bad253c75437582cb1c2a92d6929c5c05e01
      pullSecret: ngc-secret-1
      authSecret: ngc-api-secret-1
      modelEndpoint: "ngc://nvidian/nim-llm-dev/meta-llama3-8b-instruct:hf"
  storage:
    pvc:
      create: true
      storageClass: local-path
      size: "50Gi"
      volumeAccessMode: ReadWriteOnce

Sample NIMCache yaml for HF endpoint

apiVersion: apps.nvidia.com/v1alpha1
kind: NIMCache
metadata:
  name: nim-cache-universal
spec:
  source:
    hf:
      endpoint: "https://huggingface.co"
      namespace: "nvidia"
      authSecret: hf-api-secret
      modelPuller: gitlab-master.nvidia.com:5005/dl/ai-services/microservices/nim-llm/nim-agnostic:e808bad253c75437582cb1c2a92d6929c5c05e01
      pullSecret: ngc-secret-1
      modelName: "Llama-3.1-Nemotron-Nano-8B-v1"
  storage:
    pvc:
      create: true
      storageClass: local-path
      size: "50Gi"
      volumeAccessMode: ReadWriteOnce

To Do

  • Add Unit test Coverage

@copy-pr-bot
Copy link
Copy Markdown

copy-pr-bot Bot commented Jun 27, 2025

This pull request requires additional validation before any workflows can run on NVIDIA's runners.

Pull request vetters can view their responsibilities here.

Contributors can view more details about this message here.

Comment thread api/apps/v1alpha1/nimcache_types.go
Comment thread internal/controller/platform/standalone/nimservice.go
Comment thread api/apps/v1alpha1/nimcache_types.go
Signed-off-by: Vishesh Tanksale <vtanksale@nvidia.com>
@visheshtanksale visheshtanksale merged commit da381cd into NVIDIA:main Jun 28, 2025
9 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants